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A  ROBUST  CRAM HR- RAO  ANALOGUE 


Gabriela  Stangenhaus  and  H.  T.  David 


1,  INTRODUCTION  AND  SUMMARY 

We  give  an  analogue  of  the  usual  uni-parameter  Cramer-Rao  develop¬ 
ment,  in  which  unbiasedness  is  replaced  by  median-unbiasedness,  variance 
is  replaced  by  a  dispersion  measure  first  proposed  by  Alamo  (1964),  which 
we  call  local  kurtosis,  and  the  information  in  the  sample  is  computed  in 
terms  of  the  first  absolute  moment  of  the  sample  score,  rather  than  the 
second  moment. 

Given  a  sample  x  ,  with  density  f(_x;0),  consider  statistics  that 
are  median-unbiased  for  0  ;  i.e.,  statistics  6  such  that 

/  f (5c;0)d.x  =  f  f(_x;0)d2£  =  h  •  (l.i) 

x :  6  (x)  <_  0  x:  ^  00  2l  6 


Given  also  the  density  g^(«;0)  of  6,  define  K(6;0) ,  the  local  kurtosis 
of  ^  ,  by 


K(6;0)  »  (2gs(O;0)) 


(1.2) 


We  show  below  that,  in  the  "regular"  case,  (1.1)  implies  the  (L^  Cramer- 
Rao)  inequality 

K(«5;0)  >  <Ij (0))_1 .  (1.3) 

where  1^  Is  an  analogue  of  Fisher's  Information: 


ip'J) 


f  I70  ln 


f (x;0)dx. 


(1.4) 
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(1.3)  being  equally  valid  when  the  derivative  in  (1.4)  is  only  a  left 
or  a  right  derivative. 

A  "not-necessarily-regular"  version  of  (1.3);  i.e.,  a  Chapman- 
Robbins  analogue  (1951,  relation  (5)),  Is: 


lim 


K(4’w  ±  (1hFo  ' 


f(x;Q+h)  -  f(x;0) 

^  hf(x;0)  ^ 


f  (x;0)dx) 


(1.5) 


A  necessary  and  sufficient  condition  for  equality  in  (1.3)  is: 

/  In  f  (>c;0)  j  f  (x;6)dx  =  |  /  In  f(>c;Q))  f(}c;0)dxj  (1.6a) 

>c:S(x)  <_  0  x_:&(2i)  1  ^  ;  J 


and 

f  In  f  (x;0)  |f  (jc;0)dx  »  j  /  In  f  (x ;0) )  f(x;0)dxj,  (1.6b) 
x: 6 (x)  ^  0  3c:6(x)  0 

and  a  family  of  pairs  (f,6)  meeting  (1.1)  and  (1,6)  consists  of  the 
densities 

fL(x;0)  =  C  exp  L (x-0) ,  -  oo  <  x  <  oo  §  (1,7) 


where  0  =  R  and  L(-)  is  symmetric,  satisfying  certain  smoothness 
conditions  (cf.  section  4),  the  associated  statistics  <5  being  the 
maximum- likelihood  estimates  6  (x) : 

L  _ni 

£  L(x  -  (x))  =  sup  £  L(x  -  0).  (1.8) 

i  1  L  0  1  1 

K{£;0)  in  (1.2)  is  a  local  and  scalar  version  of  the  ’’risk  curve” 
a(*,0,6),  as  defined  by  relation  (1.1)  in  Birnbaum  (1961).  The  curve 
a(*,0,O,  monotone  increasing  to  the  left  of  0  and  monotone  de- 
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creasing  to  the  right,  measures  tail  size  of  the  distribution  of  6 
with  respect  to  0  ,  and  the  dispersion  of  6  about  0  is  measured  in 
non-scalar  qualitative  fashion  by  the  elevation  of  this  curve.  When  6 
is  median-unbiased  and  possesses  a  density  gg(*;0),  then  a(O,0,6)  =  % 
and  a'(0-,0,6)  =  |a,(0+1G,6)|  *  g^(O,0),  so  that  K(6;0)  is  indeed  a 
natural  scalar  summarization  of  the  elevation  of  a(',0,6). 

Alamo  (1964)  proposed  K(6;0)  in  connection  with  a  Cramer-Rao 
analogue  for  median-unbiased  estimation,  but  failed  to  replace  the 
Fisherian  bound  based  on  I^C®)  with  the  sharper  one  based  on  1^(0), 
and  thus  did  not  achieve  the  full  potential  of  his  invention.  Relations 
(1.3)  and  (1.3)  are  given  in  Stangenhaus  and  David  (1979),  but  under 
restrictive  regularity  assumptions  arising  from  deriving  (1.3)  and  (1.5) 
as  limits  of  analogous  relations,  p  >  1.  We  do  owe  our  use  of  the 

term  "local  kurtosis"  to  the  development  in  that  paper,  in  that  ordinary 
kurtosis  does  turn  out,  essentially,  to  be  the  natural  measure  of 
dispersion  for  the  case  p  =  4. 

Modifications  1^,  r  >  1,  of  Fisher's  information  appear  for 

example  in  Barankin's  study  of  minimum  s-norm  unbiased  estimation, 

1/s  +  1/r  -  1,  while  1^  itself  appears  not  to  have  been  suggested 
heretofore.  It  must  be  noted,  however,  that  all  departures 
r  y  2,  from  Fisher's  may  in  fact  be  problemat leal  qua  informat  ion . 

None  for  example  is,  as  required  by  Schutzenberger ' s  (1951)  axioms  for 
information  measures,  the  expectation  of  a  linear  functional  of  In  f. 

Though  the  present  point  of  view  appears  to  be  somewhat  distinct, 
our  conclusions  (or  at  least  some  of  their  specializations)  do  accord 
with  historical  precedent.  With  regard  to  our  Example  2.1  for  example, 
Birnbaum  (1961)  has  identified  the  sample  mid-range  as  "admissible*' 
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median-unbiased  in  the  case  of  samples  of  size  two  from  the  uniform 
distribution.  Again,  with  regard  to  our  Example  4.1,  Laplace  is 
credited  in  Birnbaum  (1964)  with  having  identified  the  sample  mean 
as  minimum-expected-absolute-deviation  among  median-unbiased  estimates, 
in  the  case  of  normal  samples. 

Each  facet  of  our  development,  be  it  the  median-unbiasedness  or 
local  kurtosis  of  6,  or  the  information  1^,  exhibits  its  own  type 
of  insensitivity  to  tail  behavior,  the  tail  in  question,  in  the  case 
of  I^,  being  that  of  the  distribution  of  the  sample  score  In  f(X;0). 

Thus  our  presentation  seems  properly  viewed  in  the  light  of  the  broad 
area  of  robust  estimation  (Huber,  1981). 

2.  A  LOWER  BOUND  FOR  THE  LOCAL  KURTOSIS 
OF  MEDIAN-UNBIASED  ESTIMATES 

We  consider  an  n-dimensional  sample  X  =  (X. , . . , ,X  )  with  a 

—  I  n 

density  f(x^0)  over  RU,  0  belonging  to  an  open  interval  0  of 
R,  and  a  statistic  6(X)  =  Y  such  that,  for  0 t  0  , 

Y  possesses  a  density  g(y;0)  over  R  ,  (2.1) 

For  such  one  has,  for  0f,  0"  e  0  , 

1(0’, 0")  -  f°_l  g6(y;0")dy  -  f  f(x;0")dx  -  <fc(O\0").  (2.2' 

_x:6(x)  <_  O’ 

and  analogously  for  the  corresponding  "upper-tail"  functions  y(0,,0") 
and  0(0', 0"). 

When  <5  is  median-unbiased  for  0,  one  has  as  well  (cf.  (1.1)), 

for  0cO  » 

1(0.0)  -  1(0.0)  -  7(0.0)  *  7(0,0)  -  ,  (2.3) 
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and,  for  0  and  positive  h  such  that  both  0  and  0  +  h  are  in  0 , 
(2.2)  and  (2.3)  imply  th^t 

/g+h  g6(y;0+h)dy  =  2(O+h,0+h)  -  1(0, G+h) 

»£(0+h,0+h)  -£(0,0+h)  »  £(0,0)  -  £(0,0+h)  (2.4) 

»  I  [f(x;0)  -  f(x;0+h)]djt  , 
x :  6  (x)  _<  0 

where  the  first,  second  and  fourth  equalities  are  due  to  (2.2),  and  the 
third  to  (2.3).  * 

Suppose  now  in  addition  that,  for  OeO  ,  g^(y;0)  is  continuous 
in  y  ,  in  a  neighborhood 

N0  of  y  -  0,  Ne  e:  0  (2.3) 

Then  for  positive  h  such  that  0  +  h  c  ,  the  law  of  the  mean  gives, 
for  0  €  0  , 

0+h 

/e  g6(y;0+h)dy  =  h.g6(0+h.Ah;  0+h)  (2.6) 

for  the  LHS  of  (2.4),  where  0  _<  A^  _<  1 ,  and,  dividing  (2.4)  by  h 
and  taking  absolute  values,  one  finds 

g(0+h-Xh;  0+h)  £  /  |f(x;0+h)  -  f(x;0)/h|dx  .  (2.7) 

2c:6(x)  £  0 

and  (2.7)  is  seen  to  be  equally  valid  for  negative  h  such  that 
0  +  h  r  N0  . 

An  analogous  argument  using  >  and  t/)  leads  as  well  to 
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g(0+h«A^;  0+h)  £  /  |f(x;0+h)  -  f  ( jc ; 0 )  | d?c  (2.8) 

jc:6(jc)  _>  0 

and,  adding  (2.8)  to  (2.7),  one  concludes  that 

2g(0+h.Ah;  0+h)  1  f  |f(x;0+h)  -  f(x;0)/h|dx.  (2.9) 

When  in  addition,  for  0eO  , 

( • ; • )  is  continuous  at  (0,0),  (2.10) 

relation  (2.9)  yields  relation  (1.5). 

We  shall  say  that  a  pair  (f,6)  satisfying  regularity  conditions 
(2.1),  (2.5)  and  (2.10),  and  also  (1.1)  plus  (1.5)  with  equality,  is 
-optimal . 

Example  2.1.  Consider  f  *  f^(x;0)  ,  equal  to  unity  for  Q-*$£x_<0+^, 
and  to  zero  otherwise,  and  also  6  the  sample  mid-range  (smr) .  The  pair 
(f^,  smr)  is  L^-optimal: 

To  begin  with  (David,  1981;  problem  2.3.5(b)), 

gsmr^y;0^  =  nO-2|y-0|)n  1,  Q-h±y<Q  +  h, 

so  that  (f^,  smr)  satisfies  (1.1),  (2.1),  (2.5)  and  (2.10),  and  also 

K(smr ; 0)  =  (2g  (0 ; 0) )  1  =  l/2n  . 

smr 

It  therefore  remains  to  show  that  the  RHS  of  (1.5)  equals  1 /2n  .  But, 
writing  the  integrand  in  the  RHS  of  (1.5)  as  |h|  ^|f(jc;0+h)  -  f(x.;0)j, 
we  see  that  the  integral  in  the  RHS  of  (1.5)  is  the  content  of  the 
symmetric  difference  of  two  unit  n-cubes  displaced  with  respect  to  each 
other  by  an  amount  /n  h  along  the  equiangular  line.  This  content  is 
2n  h  to  order  h  ,  so  that  the  RHS  of  (1.5)  is  indeed  l/2n  . 
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3.  THE  REGULAR  CASE 

Consistently  with  precedent,  we  identify  the  regular  case  as  that 
for  which,  when  0cO, 

|-j|t  In  f  (_x;0')  I  £  C0(x) ;  O'  e  Ny,  Rn,  (3.1a) 


where 


f  G^OOf  (x;G)dx  <  +  °°,  0  e  0  , 


(3.1b) 


and  where  the  derivative  may  be  construed  as  one-sided,  say  as  a  right 
derivative.  In  that  case  the  RHS  of  (1.5)  Is  bounded  below  by 


(  lira  / 
h+0+ 


f(x;0+h)  -  f(x;G) 
I  hf (xiO) 


f (x;0)dx) 


(/ 1  lim  (- 


f(x;0+h)  -  f (x;0) 


h>0+ 


hf (x;0) 


)|f(x;0)dx)_1  =  (IjO))"1, 


and  relation  (1.5)  is  seen  to  imply  relation  (1.3)  under  (3.1). 
To  study  achievability  for  (1.3),  we  write  it  in  the  form 


K(6;0)  >_  (/ In  f  (_x;0)  |  f  U;0)d^  +  /|^  In  f  (x;0)  |  f  Oc;G)dx)  1 
x:S(x)  >  0 


1  60 

x:6(x)  <  0 


(3.2) 


Now  divide  (2,4)  by  h,  as  was  done  in  obtaining  (2.7),  and  let  h 
tend  to  zero  under  assumption  (3.1),  but  without  absolute  values  taken. 
We  find 


gfi(O,0)  »  |/(^  In  f  (x;0))f  (j£ ;  0 )  d  |  , 
x:6(x)  <  0 


and,  correspondingly, 
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g6(M)  -  In  f(x;O))f(x;0)dx|; 

jc:<5(x)  >.  0 

in  other  words, 

2g6(0,e)  =  |/(-^  In  f(x;0))f(x;0)dx| 
x:  6  (x_)  £  0 

+  In  f(x;0))f(x;G)dx|.  (3.3) 

x:  6(x)  21  0 

Relations  (3,2)  and  (3,3)  make  clear  that,  indeed,  conditions  (1,6)  are 
necessary  and  sufficient  for  equality  in  (1.3),  given  conditions  (2.1), 
(2.5),  (2.10)  and  (3.1). 

We  shall  say  that  a  pair  (f,<S)  satisfying  regularity  conditions 
(2.1),  (2.5),  (2.10)  and  (3.1),  and  also  (1.1)  and  (1.6),  is  regular 
Lj -optimal . 

Example  3.1.  Consider  f  *  f^(x;0)  =  S  exp  -  |x-0 1  ,  -  °°  <  x  <  +  00  ,  and 
also  5  the  median  (med)  of  a  sample  of  odd  size  n  .  The  pair  (f^,,  med) 
is  regular  L^-optimal: 

To  begin  with,  it  is  seen  by  inspection  of  (Johnson 

and  Kotz,  1970;  p.  25)  that  (f^.,  med)  satisfies  (1.1),  (2.1),  (2.5) 
and  (2.10).  In  addition. 

In  f^(5c;0)  =  [//  of  x^’s  >  0]  -  [//  of  x^ 1  s  _<  0],  (3.4) 

and  (3.4)  is  of  one  sign  both  in  the  region  med  jc  0  and  in  the  region 

med  >  0,  so  that  (1.6)  is  true.  Moreover  (3.4)  implies  that 

\to  £  nt  so  that  (3.1)  is  seen  to  hold  with  G^(x)  =  n  . 


9 


4.  AN  Lx  EXPONENTIAL  FAMILY 

As  indicated  in  section  1,  the  pair  (f  ,6  )  in  (1.7)  and  (1.8) 

L*  Li 

may  be  shown  to  be  regular  L^-optlmal  under  certain  smoothness  conditions 
on  the  symmetric  function  L().  To  begin  with,  if  (1.8)  is  to  define 
uniquely,  it  must  be  that 

L(*)  is  strictly  concave  on  R.  (4.1) 

Next,  if  (1.8)  is  to  reduce  to  the  analytically  tractable  condition 

T.L'(x±-&l(x))  -  0,  (4.2) 

it  must  be  that 

L(*)  possesses  a  derivative  Lf(*)  on  R.  (4.3) 

Conditions  (4.1)  and  (4.3)  together  of  course  also  imply  that 

£Lf(Xj-0)  is  strictly  increasing  in  0,  (4.4) 

and  are  in  fact  sufficient  to  insure  (1.1)  and  (1.6),  as  is  now 
demonstrated  in  Lemmas  (4.2)  and  (4.3). 

Lemma  4.1.  Under  conditions  (4.1)  and  (4.3),  the  distribution  of  ^ 
is  symmetric  about  0  . 

Proof .  Under  the  stated  conditions  and  their  implications  (4.2)  and 
(4.4),  Pr{,SL(X)  <  6  +  e) 

=  Pr{LL'(Xi  -  0  -  t)  >  0}  =  Pri-EL^X,  -  0  -  e)  <  0} 

=  Pr{j]L’  (-(X  -  0)  +  e)  <  0}  =  Pr{XL'(X  -  0  +  c)  <  0) 

=  Pr{EL'(X  -  (0  -  £.))  <  0}  =  Pr{iS  (X)  >  0  -  eh 

Lemma  4.2.  Under  conditions  (4.1)  and  (4.3),  <5,  is  median-unbiased 

*  L 


for  0  (i.e.f  (1.1)  holds). 


10 


Proof .  By  Lemma  4.1. 

Lemma  4.3.  Under  conditions  (4.1)  and  (4.3),  (f  ,6  )  satisfies  (1.6). 

L  1  j 

Proof .  Since 

•gjj  In  f (x;0)  =  -  EL'^-Q),  (4.5) 

we  find,  analogously  to  the  argument  in  the  proof  of  Lemma  4.1,  that, 
under  conditions  (4.1)  and  (4.3),  and  their  implications  (4.2)  and 
(4.4), 

■k iu  f(i;M  <  °  -  {L(i)  <  »  • 

Hence  In  f(x.;0)  is  of  one  sign  both  when  6^  (*')  0  and  when 

(jO  6  ,  which  establishes  (1.6). 

Verifying  condition  (3.1)  requires  one  further  assumption;  namely, 
that  there  exist  K(e)  such  that,  for  hr  (~l,l)  and  x  f  R, 

| L ’ (x-h) |  <  K(r)  •  | L * (x) | .  (4.6) 

Lemma  4.4.  Condition  (3.1)  is  met  under  (4.1),  (4.3)  and  (4.6). 

Proof .  Given  assumption  (4.6),  for  O’  =  U  +  h,  hr  (-r,0,  and  xtR, 

| L ’ (x-0 • ) I  =  |l'((x-0)  -  h) | 

_<  K(e)  •  j  L  ’  (x-0)  |  =  g(j  c(x),  (4.7) 

and 


fg{)  c(x)  fL(x;0)dx  = 

C-K(c)[/“|L,(x-0)!eL(x"°)dx  +  /JjL*  (x-U)  |  eL(x_0)dx] 


(4.8) 


It  follows  that  the  function 


G0,e(^  ’  I'80,c(xi) 


satisfies  (3.1),  since,  by  (A. 7),  for  0*  t  (G-c,  0+e) , 


\jqT  In  fL(x;0’)|  1  S | L * (x1~0,> 


-  Xg0,c(Ki)  “  G0,r.(^* 


and  also,  by  (A. 8), 


■fGetC(x)  fL(x;P)dx  =  yjgQ  ^(xp  fi/xi;0* 


=  2nC  eL^°')K(£)  . 


It  remains  to  verify  (2.1),  (2.5)  and  (2.10).  Unfortunately  it 
seems  difficult  to  adduce  tractable  conditions  on  L(-)  insuring  (2.1) 
that  are  not  at  the  same  time  unreasonably  restrictive.  Thus  we  simply 
add  (2.1)  itself  to  the  list  of  requirements  for  L(*)s 

!*(•)  is  such  that  6  possesses  a  density  g-  .  (A. 9) 

L  \ 

Given  (A. 9),  it  is  however  possible  to  condense  (2.5)  and  (2.10)  into 
a  single  condition,  through  the  observation  that  0  is  a  location 
parameter  for  the  distribution  of  6  .  This  last  is  seen  by  adding 

and  subtracting  0  inside  the  main  brackets  of  (A. 2),  whereupon  it 
follows  that  6^(X)  ~  0  is  a  function  only  of  the  parameter-free 
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% 


quantities  (X^O) .  At  any  rate  the  density  of  (4.9)  may  now  be 

L 

written 

gx  (y;0)  ■  g(y-O)*  (4.10) 

L 

so  that  conditions  (2.5)  and  (2.10)  are  seen  to  reduce  to  the  single 
condition 


g  In  (4.10)  is  continuous  in  a  neighborhood  of  0.  (4.11) 

In  summary,  then,  conditions  on  L(*)  insuring  that  the  pair 
(f L» <5^)  in  (1.7)  and  (1.8)  is  regular  L^-optimal  are  (4.1),  (4.3), 

(4.6),  (4.9)  and  (4.11). 

Example  4.1.  (N(0,1),  X)  is  regular  L^-optimal. 
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